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Abstract — Linear codes for error detection on a g-ary sym- 
metric channel are studied. It is shown that for given dimension 
k and minimum distance d, there exists a value ^,{d,k) such 
that if C is a code of length n > ii{d,k), then neither C nor 
its dual are good for error detection. For d » fc or fc S> rf 
good approximations for fi{d, k) are given. A generalization to 
non-linear codes is also given. 

I. Introduction 

The q-ary symmetric channel with symbol probability p, 
where < p < ^y!-, is defined as follows: symbols from 
GF{q) are transmitted over the channel, and 



P{b received | a sent) 



1-p 
p 

9-1 



b ~ a 



Suppose that a g-ary linear [ti, fc, d] code C is used for error 
detection for transmission over the g-ary symmetric channel 
with symbol error probability p. The probability of undetected 
error for C is denoted by Pue(C', V)- For most codes we are not 
able to determine the value of Pue{C,p) exactly. Therefore, 
it is useful to have estimates. C is called good (for error 
detection) if 

Puc(C,p)<Puc(C,(q-l)/q) = ^^. (1) 

Only some codes have this property. The purpose of this paper 
is to show that, for given dimension k and minimum distance 
d, there exists a value p{d, k) such that if n > p{d, k) and C 
is an [n, /c, d\ code, then neither C nor its dual are good 
for error detection. Further, for d » fc or fc ^ d we give good 
approximations of k). At the end of the paper we give 
similar results for non-linear codes. 

II. Existence of ^{d, k) 

If Ai, i = 0, is the weight distribution of C, 

then the probability of undetected error for C and on 
a symmetric channel with symbol error probability p is given 
by 



Puc(C,p) = ^A,(-^)\l-p)" 

1 y 



(2) 



i=0 



where Q = q/{q — 1). 

Note that C is good if Puo(C,p) < q'"'" - g"" for all 
p e (0, {q - l)/q). We call C bad if Puc{C,p) > q''-" for 



some p G (0, {q — l)/?)- Note that being bad is a stronger 
condition than not being good. We also note that most codes 
are either good or bad, but a code may be neither. 

Lemma 1: C is bad if and only if is bad. 
Proof. Suppose C is bad. By definition there is a p G (0, {q — 
l)/q) such that Puc{C,p) > q'^"". Define tt by 

P 



that is 



l-QiT 



(g - 1) - qp 

qil-p) 



(q-m-p)- 



and 1 — TT = 



g(i-p)' 



Then 



< 



PueiC,p) 



2=1 



(i 



(q-m-p). 
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(l-p)"(5]A,;(l-Q^)*-l) 

i=0 

(l-p)"((?'=Pue(C^,^) + (l-vr)"-l) 

(l-p)"(g^-Pue(C^,^)- 



+g'^(l-7r)"(l-p)" 

= (l-p)"((7'=Pue(C^,^)-l) +9*^-" 

and hence Puc(C^,7r) > q~'', that is, is bad. Since 
C^^ = C we get the if and only if. □ 

Remark 1. For q ^ 2, Lemma^is Theorem 3.4.2 part 1 in 
[1]. The proof for general q given above is a straightforward 
generalization of the proof for q = 2 given in [1]. 

Remark 2. It is not the case that C not good implies that 
is not good. 

We want to find sufficient conditions for a code to be bad. 
If the minimum distance of C is d (that is, Ai = for i = 
1, 2, . . . , d - 1 and A^ > 0), then > g - 1 by the linearity 
and hence 

PneiC^,p) > q-' + q-\q - 1)(1 - Qpf - {I - pT > 

if {q - 1)(1 - Qp)"^ > q''{l -p)'". Taking logarithms, the last 
condition is equivalent to 

\n{q - 1) + dln(l - Qp) > n\n{l - p) + kln{q). 



Hence, combining with Lemma^ we get the following lemma. 



Lemma 2: If 



n > h{p) 



rfln(l - Qp) - fcln(g) + ln(g - 1) 
ln(l - p) 

then both C and are bad. 

We will use the notations k = fcln(q) — 111(5 ^ 1)' 

-1 



ln(l - Qp) 



ln(l - p) ■ 



Then 



Hp) = d.f{p) + Kg{p). 



(4) 



Any choice of p, < p < {q — l)/q now gives a proof of 
the existence of /i(fc, d) such that if n > fi{d, k) and C is an 
[n, /c, c?] code, then neither C nor its dual are good for 
error detection. 

To get the strongest result from the lemma, we want to find 
the p that minimizes h{p). The function f{p) is increasing on 
(0, {q — l)/?), it approaches the value Q when p 0+, and 
it approaches infinity when p ^ {q — l)/q—. Moreover, 

-Q(l - p) ln(l - p) + (1 - Qp) ln(l - Qp) 



f'ip) 



and 



where 



f'ip) 



hip) 



(l_p)(l_Qp)ln(l-p)2 

IM 

-(l-p)2(l-Qp)2(ln(l-p))3' 

g2(l-p)2(ln(l-p))2 
+2Q(l-p)(l-Qp)ln(l-p) 
-2(l-Op)2ln(l-Qp) 
-(l-Qp)2ln(l-j5) In(l-Qp) 
> 

for all p E (0, ((7— l)/?). Hence / is convex on (0, ((? — 1) /q)- 
Similarly, the function g{p) is decreasing on {0,{q — l)/q), 
it approaches infinity when p 0+, and it takes the value 
— l/\n{q) for p = {q — l)/q. Moreover, 

1 l-Qp 



9'{P) = 



(1 - p) ln(l - p)2 (1 -p){l- Qp) ln(l - p)2 



9"{P) = 



-(2 + ln(l-p)) 



> 



(1 -p)2(ln(l 

for allp e (0, (g— l)/g), and so g{p) is also convex on (0, {q — 
1 )/(?). This implies that the combined function h{p) is also 
convex on (0, (q — 1)/ q) since k > 0, and it takes its minimum 
somewhere in (0, (g — l)/'z). We denote this minimum by 
k). From Lemmas[2and|2]we get the following corollary. 

Corollary 1: If n > (u((i, k), then neither C nor are 
good for error detection. 

We next find approximations for k) when d ;3> fc or fc ^ 
d. We denote by Pm the value of p where h{p) has its minimum 
(the minimum is by definition k)). 



III. Approximations for ^(d, fc) when d > fc 
Theorem 1: Assume that d — *■ cxd and fc/d ^ 0. Let 



. = fcln(,)-ln(g-l)andy .,2^Q(g_i) 

There exist numbers at and bi for i = 1,2,... such that, for 
any r > 0, 

r 



and 



(d, fc) = dQ + 2dQ(Q - 1) ^ + 0{y^+^). 



The first few a.i and 6^ are given by the following table: 

ai = 2, 

a2 - -(8Q + 2)/3, 

as = (2602 + 220-l)/9, 

04 = -(368Q^ + 708Q2 _ i2Q + 8)/135, 

bi = 1, 

62 = (2Q-l)/3, 

63 = (202_2Q-1)/18, 

64 = -(4Q3 - 6Q2 _ 6Q + 4)/135. 
Proof: First we note that k = 2dQ(Q — l)y'^ and so 

Hl-Qp)-2QiQ-l)y^ 



h{p) = d ■ 



ln(l - p) 



and 



where 



H{p,y) 



,// X , y) 

(l-p)(l-Qp)(ln(l-p))2' 



-g(l - p) ln(l - p) + (1 - gp) ln(l - Qp) 
-2Q{Q-l)y\l-Qp). 



Hence h'ijj) = if H{p, y) = 0. Taking the Taylor expansion 
of H{Y,a,y\y) we get 



HC^ary\y) 



'^{Qal + a? + 6a2 + l2Q)y^ + ■ 
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All coefficients should be zero. In particular, the coefficient 
of y^ shows that a\ — 4. Since aiy^ is the dominating 
term in the expression for p when y is small and p > 0, 
we must have ai > and so ai = 2. Next the coefficient 
of y'^ shows that a2 = — (16g + 4)/6. In general, we get 
equations in the which can be used to determine the 
recursively. The recursions seems to be quite complicated in 
general and we have not found an explicit general expression 
for ai. Substituting the expression for p into h{p) and taking 
Taylor expansion, we get the expression for /i(d, k). □ 

Remark. We do not know when the infinite series X^i^i '^iV^ 
and YlTLi biy'' converge (this may depend on q and p). 



Example 1: Consider q = 2, d = 1000 and k = 2. 
Solving h'{p) = numerically, we get p « 0.0352540 and 
/Lt(1000, 2) w 2075.8565430. Taking one, two, three, and four 
terms respectively in the expression for /i(1000, 2) in Theorem 
n] we get the following approximations: 



no. of terms 


value 


1 


2000 


2 


2074.4659482 


3 


2075.8522426 


4 


2075.8565439 



IV. Approximations for ^i{d, k) when fc > d 
Theorem 2: Assume that k ^ oo and d/k ^ 0. Let 

K = k ln(g) — ln(q — 1), A = ln((7), 

e = d/K, A = ln(0A/((7 - 1)) 

There exist polynomials Ai{x) and Bi{x) for i = 0, 1, 2, . . 
such that, for any r > 0, 



1 



A 



and 



QPm = —J + 0(^+1), 



The first few Ai{x) and Bi{x) are given by the following 
table: 

Ai{x) = 1, 

^2(2;) = 2; + A-l, 

^3(2;) = (2a;^ + (4A-2)a; + 2A2-3A)/2, 

Bo{x) = 1, 

-Bi(a:) = -{x-l), 

B2{x) = -{2x-X + 2)/2, 

Bsix) = -{3x^ + 3Xx + X^ 
Proof: Let r/ = q—l and tt = 1 — Qp. Then 1— p = (l + 777r)/(j 
and Q{1 — p) = (1 + rji:)/!]. Hence 

G(7r,0) 



where 



(l-p)(l-Qp)ln(l-p)2' 

On ln(7r) — TT. 

) = 0. 
hi{q) and tt ln(7r) 



^ 1 + r77r , /l + Tm 
-0 In' 



77 \ q 

Therefore, h'{p) = if and only if G{-k, 6) = Q 

If TT ^ 0+, then ln((l + ?y7r)/g) 
0. Hence, for small tt. 







G{il,e) ln(q) 



Therefore, tt « ^A/t]. We write tt = 6'A(1 + y)/rj (where y 
will depend on 6). This implies that 

ln(7r) = A + ln(l + y). 



Hence, if g{y) = G'(7r, 6) we get 

g(y) = ^{-(l + 0A(l + y))ln(l + 0A(l+t/)) 

+ (l + 0A(l + y))A 
+0A(l + 2/)(A + ln(l + 2/)) 
-A(l + y)}. 

We now write y = aj(A)6'*+0(6'''+^). Formally treating 

A as if it were a constant, we can take the Taylor expansion 
of giy) in terms of 9 and we get an expansion of the form 
YlTLi Ci^*' where the c; are polynomials of A. Since these 
polynomials Ci must be identically zero, we get equations to 
determine the polynomials Ai{x). Substituting the series of 
into h{p) and taking the Taylor expansion, we get the series 
of fi{d,k). □ 

Remark. To formally justify that we treat A as if it were a 
constant, we should prove that A is algebraically independent 
of the other quantities involved, that is, there is no non- 
trivial polynomial equation in A with coefficients expressed 
as rational functions of the remaining quantities. We have not 
done this, but it highly likely that it is true. 

As to convergence, the situation is similar to Theorem ^ 
Example 2: Consider q — 2, d — 2 and k — 1000. 
Solving h'(p) — numerically, we get p « 0.4990185 and 
fi{2, 1000) w 1020.8737393. Taking one, two, three, and four 
terms respectively in the expression for /i(2, 1000) in Theorem 
|2] we get the following approximations: 



no. of terms 


value 


1 


1000 


2 


1020.8169587 


3 


1020.8741383 


4 


1020.8737362 



V. Non-linear codes 

To get results for a non-linear code C, we have to do some 
modifications. In stead of the weight distribution, we let {Ai} 
denote the distance distribution (for linear codes these are the 
same). Then equation (|2} is still valid for non-linear codes. 

Non-linear codes do not have dual codes so results like 
(|3} and Lemma ^ do not make sense in the non-linear case. 
However, if we let k — log^(|C|) and define 

n 
i=0 

this is a welldefined function, and for linear codes 
Puc{C-^,p) = P^^{C,p). Moreover, using P^(C, tt) in stead 
of Puo(C^,7r) and Pje(C,p) in stead of Pue(C-L,p) in the 
proof of Lemmas and |2l we get the following result. 

Lemma 3: If C is a bad code of length n and size M, and 
then 



P^,{C,7t)>1/M. 
From this we get a lemma similar to Lemma |2 However, 
whereas for linear codes we have Ad > q — I, for non-linear 



codes we only know in general that Ad > 1/Af. Hence we 
get the following weaker version of Lemma |2] in general. 
Lemma 4: If C is a code of length n and size M, and 

rfln(l-Qp)-21og^(Af)ln(g) 



n > hN{p) 



(5) 



In(l-p) 
then C is bad. 

The analysis of the cases d k and k dto find the best 
choice of p is similar to the linear case, the only difference is 
that for the general case k = 2fcln(g). With this modification. 
Theorems and |2l are valid also for non-linear codes. Let 
liM{d,k) denote the minimum of h^ip) (for k — 2khi{q)) 
and let pjv be the value of p that gives this minimum. 

Theorem 3: Assume that d oo and k/d ^ 0. Let 



K = 2k \n{q) and y 
For any r > 0, 



2dQiQ-l)' 



and 



Mat 



i=l 



(d, k)^dQ + 2dQ{Q - 1) ^ hy' + 0(2/"+i). 



where and hi are the numbers given in Theorem [2 



Theorem 4: Assume that fc ^ oo and d/k ^ 0. Let 

K = 2k\ii{q), A = ln(g), 

9 = d/n, A = ln{eX/{q - 1)) 



For any r > 0, 



1 



-^A,(A)0' + 0(^+1) 



and 



i=0 



where Ai{x) and Bi{x) are the polynomials given in Theorem 

m 

Example 3: For the values of d and /c (and q — 2) consid- 
ered in the examples for linear codes, we get the following 



approximate values 


for the 


bounds: 




d 


k 


H{d,k) 


UNid, k) 


1000 


2 


2075.86 


2108.10 


2 


1000 


1020.87 


2022.85 



This illustrate the results we have found that for large d, 
IJ.{d,k) and iJL{d,k)M are both approximately 2d (taking only 
the first term of the approximation) and for large fc, ^{d^ k) 
is approximately k whereas ii{d,k)M is approximately 2k 
(reflecting the fact that k, is approximately twice as large in 
the non-linear case). 
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